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Abstract — Adaptive techniques make practical many quantum 
measurements tliat would otherwise be beyond current labo- 
ratory capabilities. For example: they allow discrimination of 
nonorthogonal states with a probability of error equal to the 
Helstrom bound; they allow measurement of the phase of a 
quantum oscillator with accuracy approaching (or in some cases 
attaining) the Heisenberg limit; and they allow estimation of 
phase in interferometry with a variance scaling at the Heisenberg 
limit, using only single qubit measurement and control. Each of 
these examples has close links with quantum information, in 
particular experimental optical quantum information: the first 
is a basic quantum communication protocol; the second has 
potential application in linear optical quantum computing; the 
third uses an adaptive protocol inspired by the quantum phase 
estimation algorithm. We discuss each of these examples, and 
their implementation in the laboratory, but concentrate upon the 
last, which was published most recently [Higgins et al., Nature 
vol. 450, p. 393, 2007]. 

Index Terms — quantum, adaptive, measurement, interferome- 
try, optical, computing, phase, estimation, algorithm 

I. Introduction 

MEASUREMENT of a quantum system has convention- 
ally been defined in terms of an observable, represented 
by an Hermitian operator on the system's Hilbert space. How- 
ever, it is now widely recognized that many realistic measure- 
ments should not be described in such terms Rather, the 
formalism of generalized measurements, described by a set of 
positive maps, is required. Such generalized measurements are 
not only necessary for describing realistic detection; they also 
allow for interesting protocols that conventional (projective) 
measurements cannot achieve. A simple example is unambigu- 
ous (but probabilistic) state-discrimination for nonorthogonal 
states 121, E]. 

A powerful way to generate interesting generalized mea- 
surements from available detectors in the laboratory is by 
adaptive measurement protocols. By this we mean the follow- 
ing: An incomplete measurement is made on the system, and 
its result used to choose the nature of the second measurement 
made on the system, and so on (until the measurement is 
complete). A complete measurement is one which leaves the 
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system in a state independent of its initial state, and hence con- 
taining no further information of use [ 1 J . A measurement may 
be incomplete by being a weak (non-projective) measurement 
on the system as a whole; or by being a strong (projective) 
measurement but only on a subsystem; or in other ways. All 
types allow for adaptive protocols. 

Adaptive measurements connect to quantum information, 
in particular experimental quantum optical information, in a 
number of ways. We will review three examples in Sec. II: 
distinguishing non-orthogonal states; optical phase measure- 
ment; and interferometric phase estimation. The first is a basic 
protocol in quantum communication, the second has potential 
applications in linear optics quantum computing, and the third 
has been inspired by quantum computing algorithm theory. All 
three have been realized in the laboratory in recent years ID, 
||5l . 0. The last, most recent, of these is analysed in detail 
from a quantum information perspective in the remaining 
Sections of the paper, which cover: quantum limits to phase 
estimation (Sec. Ill); the Quantum Phase Estimation Algorithm 
(Sec. IV), and its generalization (Sec. V); and whether adaptive 
measurements are necessary in this context (Sec. VI). 

II. Applications for Adaptive Measurements 
A. Distinguisliing Non-Ortliogonal States 

The idea of using adaptive measurements for discriminating 
between two non-orthorgonal quantum states was introduced 
as early as 1973 |7|. "Dolinar's receiver" is an optical tech- 
nique, applicable to a traveling mode prepared in one of two 
possible coherent states. The object is to discriminate the 
preparations with minimum probability of error The minimum 
possible error probability is known as the Helstrom bound |8|, 
which in this case is (l/4)e-l^"l' , where Aa is the difference 
between the coherent amplitudes of the two states. In this 
case the Helstrom measurement could be realized simply by 
measuring a suitable observable on the harmonic oscillator 
Hilbert space. However, this observable does not correspond 
to any of the observables usually measured in quantum optics, 
such as a quadrature, or a displaced photon number operator 
Indeed, the obvious scheme of measuring the quadrature Xg, 
with 6 = arg(Aa), gives a probability of error of scaling as 
the square root of the Helstrom bound for large |Aa| [7J- 

Surprisingly, by using adaptive detection, one can precisely 
achieve the Helstrom bound [7 1. One must measure the leading 
segment of the pulse, obtain a result, and use that result to 
alter the measurement on the next segment of the pulse, and 
so on. The Dolinar receiver requires taking the continuous 
time limit for these segments of pulse, but reasonable results 
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can be obtained as long as the pulse mode has a duration 
long compared to the delay in the feedback loop |9|. The 
Dolinar scheme requires measuring a displaced photon number 
operator (using a weak local oscillator and a photon counter), 
and altering the displacement whenever a photodetection oc- 
curs. Very precise control over the applied displacements is 
required, and very fast electro-optics. For these reasons the 
"Dolinar receiver" was not realized experimentally until 2007, 
by Geremia and co-workers |4|. This experiment showed 
clearly the improvement over the most obvious non-adaptive 
technique for mean photon numbers between 0.1 and 1. 

The Dolinar receiver is most naturally and simply described 
as an adaptive scheme based on weak measurements, with 
the system being one of two coherent states (e.g. |±a)) 
of a single-mode harmonic oscillator, of spatial duration L. 
However, because of the unique properties of coherent states, 
the system can also be thought of as a series of shorter 
modes, of length L/M, each of which is prepared in the same 
coherent state ±a/\/l\d'^. In the limit M oo, each of these 
short modes corresponds to the system that is measured at a 
particular time by the detector Thus, in this guise, the Dolinar 
receiver appears as an adaptive scheme based on projective 
measurements of sub-systems, from an ensemble of identically 
prepared systems. 

One can take this analysis further. In the M ^ oo limit 
the subsystems can be treated as qubits. The reason is that 
the mean excitation number for the adaptively-displaced states 
scales as M^^, and so have support on just the first two 
number states, |0) and which are the eigenstates of the 
measured quantity (photon number)}^ If the two undisplaced 
global coherent states are |±a), then to leading order the corre- 
sponding qubit states are |0) ± (a/\/M) and displacement 
in phase space (by a distance of order Af^^/^) is equivalent 
to a qubit rotation (by an angle of order M^^/^), and this 
rotation makes a measurement in the {|0) , |1)} basis the 
desired adaptive measurement. Moreover, the Dolinar scheme 
can then be derived as a special case of the optimal adaptive 
scheme for discriminating between two non-orthogonal qubit 
states, when one has multiple copies of them. 

This latter problem, with arbitrary pure qubit states, was 
solved by Acin et al. |10|, who gave a simple interpretation 
of the optimal scheme: it corresponds to making the optimal 
local Helstrom measurement on each copy, taking into ac- 
count the probability one assigns to each of the two possible 
preparation procedures, updated according to all the preceding 
measurements. Not only is this conceptually simple procedure 
the optimal adaptive scheme, it also reaches the Helstrom 
bound for the entire ensemble (as was known already for the 
special limit of the Dolinar protocol). That is, contrary to what 
one might have thought, an entangling measurement across 
all subsystems is not required. Note also that the restriction to 
qubits in this quantum information setting is no real restriction; 
if (unlike in the optical case) one assumes one can make 

'There is a technical issue in that, according to Dolinar's protocol, the 
displacement diverges at the initial time if the two states are initially equally 
likely. However, as demonstrated in Ref. |4|, a modified protocol with only a 
moderately large displacement obtains the great majority of the improvement 
offered by Dolinar's protocol. 



arbitrary projective measurements on a single subsystem then 
one needs only two basis states to describe a system in one 
of two possible pure states. 

The situation with mixed states is, however, much more 
complicated ifTTI . For qubits (which is now a meaningful 
restriction), even in the asymptotic Umit there is a clear 
separation in performance between 

1) the Helstrom bound (achieved by the optimal joint 
measurement); 

2) the optimal adaptive scheme (involving dynamic pro- 
gramming); 

3) the locally optimal adaptive scheme (i.e. the locally 
optimal Helstrom measurement); and 

4) the obvious non-adaptive scheme (a majority vote from 
repeated unbiased measurements). 

Moreover, surprisingly, the latter two actually reverse order 
(i.e. the locally optimal scheme performs worse than the non- 
adaptive scheme) for a sufficient degree of mixture. 

Before leaving the topic of adaptive measurements for state 
discrimination we should mention the work of Jacobs |12| on 
adaptive continuous measurements on a single qubit prepared 
in one of two non-orthogonal states \4>±). The continuous 
measurement limit can be thought of (as in the Dolinar case) 
as a sequence of weak measurements of duration At, whose 
disturbance also scales as At, and taking the limit At — > dt. In 
this case, if one measures for an infinitely long time then one 
can realize the Helstrom bound for distinguishability simply 
by making the same weak measurements at every step, of the 
Pauli operator proportional to ((/)+| — {4>-\- However 
for some finite times, more information can be obtained by 
using a locally optimized adaptive measurement in which the 
measured Pauli operator is continually rotated such that its 
expected value is zero at all times |12|. At sufficiently long 
times, this scheme always becomes worse than the simple non- 
adaptive scheme. It is also worth noting that if one considers 
the probability of error in one's guess of which preparation 
was performed, rather than the mutual information between 
one's record and the preparation, then the simple non-adaptive 
scheme always performs better Iil3i . 

B. Optical Phase Measurement 

Although the Dolinar receiver was the first notable adaptive 
measurement scheme to be introduced theoretically, it was 
not the first to be realized experimentally. That honour goes 
to the adaptive phase measurement algorithm introduced by 
Wiseman and Killip Iil4i . Iil5 1, and realized by Mabuchi 
and co-workers jSl. This technique is based on homodyne 
detection, rather than photodetection as in the Dolinar receiver 
Its aim is not state discrimination (although it can be useful 
for that 1 15 1), but rather measuring a physical quantity: the 
phase of the state. The optimal measurement to do this, 
a canonical phase measurement, would give a measurement 
result (j) with variance equal to the intrinsic phase variance of 
the state, by definition [16|. However, this cannot be realized 
by standard optical measurements. For a state with initially 
completely unknown phase, the best standard technique is 
heterodyne detection lfT6l . This introduces an excess phase 
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variance scaling as l/4fi, where n is the mean number of 
photons in the state fTTl. This scaling is known as the standard 
quantum limit (SQL). 

If one knew that the phase of the state was approximately Lp, 
one could make a homodyne measurement of the quadrature 
Xq with 9 — If + it/ 2, and this would be almost as good 
as a canonical phase measurement for many types of states]^ 
The intuitive idea of an adaptive homodyne measurement is to 
begin with homodyne measurement of a random quadrature, 
and then to adjust the local oscillator phase 9 adaptively over 
time to ifif) + 7r/2. Here Lp{t) is an estimate of the system 
phase based on the homodyne data so far. Interestingly, one 
does not want to choose (f{t) to be the best estimate of 
(j) at time t — that actually gives inferior performance for 
some states |18|. At the end of the measurement, one does 
want to chose the final estimate to be the best estimate. 
For the "Mark 11" scheme of Wiseman and Killip |14|, the 
excess phase variance scales as l/Sn'^/^, far smaller than 
that of the best non-adaptive measurement. More complicated 
adaptive schemes can do even better l(T8l . near the ultimate 
(Heisenberg) limit scaling of 

The Mark II scheme was experimentally implemented using 
small, coherently excited, microwave-frequency sidebands of a 
large coherent beam |5|. In this case the phase to be estimated 
was actually the phase of the microwave excitation, as carried 
by the optical frequency sidebands. An improvement over the 
best non-adaptive scheme was seen for mean photon numbers 
n between about 10 and 300. Because the experiment was 
performed with coherent states, it is possible to think of it (like 
the Dolinar case) as a series of projective (quadrature) mea- 
surements performed on identically prepared weak coherent 
states. However this would not be the case if the experiment 
were performed with a nonclassical state such as a squeezed 
state. In that case it would be necessary to adopt the more 
natural description, of a succession of weak measurement on 
a single mode. 

It is impossible for an adaptive homodyne measurement 
to attain the accuracy of a canonical phase measurement in 
general f21- However, there is a known exception: when the 
state has support on the |0) and |1) photon number states. This 
case was actually solved in the paper which first proposed and 
analysed adaptive phase measurement [ 17 1, and is the "Mark I" 
scheme of Refs. lfT4l . ifTSl . It has the interesting consequence 
that, given a single photon, it is possible to create deterministi- 
cally an arbitrary superposition of the states |0) and This is 
done by first creating the "mode-entangled" 1 19 1 single-photon 
state yTyjOjl) + -^1 — 77 11, 0), and then making a canonical 
phase measurement on one mode, using the Mark I scheme. 
This yields a completely random result (/> (which emphasizes 
that in this case there is no approximate initial phase that a 
nonadaptive homodyne scheme could use), and collapses the 
second mode into the state y^e"^ |0) + ^/l — 77 Since 
is known, the phase of this superposition can then be adjusted 
to the desired phase. 

The ability to create arbitrary superpositions of |0) and 

^The 7r/2 phase difference between d and here follows from a convention 
regarding the phase introduced by a beam-splitter, which is generally used in 
papers on this topic, e.g. Refs. Q3, (l4l, (T5|, (T^, (5|. 



|1> ^(|1>|0> + |0>|1» ^(e"*|l>|0>+e-^*|0>|l)) 
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Fig. 1. Conceptual diagram of the generalized QPEA implemented using 
a Mach-Zehnder interferometer, shown with quantum states (expressed in 
the photon-number basis) at key points. The large phase-shift element is 
configured to implement an adjustable number p of cii-phase shifts on photons 
passing through the upper arm (in this example, p = 8). The small phase-shift 
element implements an adjustable p9 phase shift on photons passing through 
the lower arm. The output of the single photon detectors determines, via the 
processor, how to adjust 6 prior to the next photon input, and also the final 
phase estimate <j>cst- 

|1) photon states is of course the ability to create arbitrary 
photonic qubit states using "single-rail" logic |20|. It turns out 
that adaptive phase measurements can also be applied at other 
points in Unear optics quantum computing to make it possible 
to use single-rail encoding with resources that are far smaller 
than was previously thought possible, and not substantially 
larger than the resources required for conventional "dual-rail" 
encoding fT\\. 

It is worth noting here that even a "conventional" linear 
optics quantum computation can be regarded as an enormous 
adaptive measurement protocol, since it involves making mea- 
surements on subsystems (single photons) of an entangled 
state, and manipulating the remainder of the photons in a 
way that depends upon the results of the prior measurements, 
before a final measurement that reveals the result of the com- 
putation. This is true of both the circuit architecture and the 
cluster-state architecture for LOQC |20l. Indeed the 2007 four- 
photon cluster-state experiment from the group of Zeilinger 
|22J is perhaps the first adaptive measurement in which the 
subsystems were entangled prior to the measurement. 

C. Interferometric Phase Estimation 

While adaptive optical phase measurement is potentially 
useful for quantum computing, the third and final application 
we consider is linked to quantum computation in the opposite 
way. That is, it uses an algorithm from quantum computing 
theory to inspire new adaptive protocols. Specifically, based 
on the quantum phase estimation algorithm (QPEA) of Cleve 
et al. II23I . II24I . we have devised a new family of adaptive 
protocols for interferometric phase estimation |6|. Moreover, 
we have implemented one of these algorithms, demonstrating 
Heisenberg-limited scaling for phase estimation for the first 
time IS). 
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The bulk of this paper is dedicated to explaining adaptive 
(and non-adaptive) measurements for interferometric phase 
estimation from a quantum information perspective. In this 
section we will concentrate on placing it in the context of 
optical interferometry (although the idea could work equally 
well with particles — such as neutrons |25| — other than 
photons). A conceptual experimental diagram is shown in 
Fig. [T] The key differences from a standard Mach-Zehnder 
interferometer are 

1) The number of passes of one arm through the unknown 
phase shift can be greater than one, and is assumed 
controllable over the course of the experiment. 

2) The phase shift in the other arm is assumed controllable 
over the course of the experiment. 

In standard interferometry, N indepedendent photon- 
detections allow the unknown phase to be estimated with 
accuracy A(j) = I/Vn (for large N), which is known as the 
standard quantum limit (SQL). By contrast, the Heisenberg 
limit (HL) is quadratically better: = tt/N. 

Those familiar with the QPEA may be surprised to find 
that applying a generalization of this algorithm to the Mach- 
Zehnder interferometer (as will be explored in later sections) 
yields only a quadratic improvement in the accuracy. The 
QPEA is supposed to give a binary phase readout, implying 
an uncertainty exponentially small in the size of the register 
Also, it is at the heart of Shor's algorithm f26l which gives an 
exponential speed-up over classical algorithms for factoring. 
In quantum computing theory, the QPEA is used to estimate 
the phase of an eigenvalue e'*^ of a (typically multi-qubit) 
unitary operator U that corresponds to some calculation. If 
one can do a quantum computation implementing U, then one 
can do (more or less) with the same resource cost a quantum 
computation implementing U^, for any p. Thus in a quantum 
algorithm context, the number of "passes" p is irrelevant. 

If we were to follow an analogous method of resource 
counting in interferometry — count simply the number of 
photons irrespective of the number of times p each photon 
passes through the unknown phase shift — then the algo- 
rithm implemented in Ref. |6| would have yielded a 
exponentially small in N. This violates the long-established 
Heisenberg limit scaHng fTP\, f8] with A0 of order l/N. 
The (correct) Heisenberg scaling is obtained by counting not 
photons, but rather the total number of photon-passes through 
the unknown phase shift |28| . That is, one should count each 
photon involving a p-pass interferometer (as shown in Fig. [T]) 
as using p resources, not 1. 

This method of counting is not justified merely by giving 
the expected Heisenberg-limit. Rather, it has a number of other 
justifications. First, photon number is not a sensible resource. 
For example, if one allows for non-demolition photon-number 
measurements, one could repeat a standard interferometry 
experiment arbitrarily many times, thus obtaining an arbitrarily 
small uncertainty, using just a single photon, as shown in 
Fig. [2] Other nonlinear optical processes allow for scenarios 
where it is not even clear how to define the number of photons 
||29i . Second, one practical reason for caring about photon 
number is if one has a sample that is extremely sensitive to 
light. In this situation what is relevant is clearly the number 




Fig. 2. Arbitrarily accurate pliase estimation may be done with a single 
photon if QND measurements are allowed. After detection, the photon 
(wherever it is found) is redirected back into the interferometer. 



of photon-passes through the sample, not anything else. Third, 
if one counted photons rather than photon-passes, this would 
ignore the extra time it takes for a photon to make p passes, 
rather than 1 pass, through the sample. In the asymptotic 
regime of arbitrarily large N (and hence large p), this is 
necessarily the time that will determine the duration of the 
experiment. 

It might be thought that the technique of using p passes 
of a single photon is not really a measurement of 4> at all, 
but rather a measurement of pcj). From this one would of 
course expect a sensitivity in (j> that scales as p from a single 
photon measurement, and so an overall uncertainty in (f> scaling 
inversely with the total number of photon passes is just as 
expected. This works only if one already knows the phase 
approximately, and the deviations one is trying to detect are 
much smaller than 27r/p. This is quite different from the 
fundamental problem of estimating a completely unknown 
phase 0, which is the task to which the Heisenberg limit 
pertains 1271 . HI. In the latter case, one cannot simply set 
p to be as large as it can be, and then measure pcj), because 
this would yield information only about modulo 2tt/p. 
Rather, even if one sets p to high values at some stage of 
the experiment, at some other stage(s) it must be set to one 
in order to pin down a single value for </> within the range 
[0,27r). That is to say, for the task we are interested in, the 
measured quantity is the variable </> modulo 2tt, not the variable 
pcj) modulo 27r. The optimal way to vary p over the course of 
the experiment is then the crucial issue and the QPEA suggests 
an answer (or at least a starting point |6l.) 

In many practical examples, the strategy of fixing p at a 
value as large as possible works because one is trying to detect 
small changes to an already known phase. This is the case in 
gravitational wave interferometry, and atomic clocks. In the 
former case, the maximum number of passes is set by the 
geometry of the experiment and the quality of the mirrors, 
and this determines the accuracy 130]. In the latter case, 
the decoherence time of the atoms determines the maximum 
number of "passes" (Rabi cycles) the atoms can undergo, and 
the standard quantum limit is determined by this, and the 
number of atoms in the sample 131] . In both of these cases 
one is interested in minimizing the error, and hence one would 
ideally measure pcj) with a large ensemble Af of identical 
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Fig. 3. Single-pass Mach-Zehnder interferometer with arbitrary inputs and 

arbitrary measurements. It is known that this can achieve the Heisenberg-limit pig. 4. Realising a multipass Mach-Zehnder interferometer from a single- 
A4> ~ Tv/N for an A^-photon input state. pass (inside the round-cornered box) by recycling the outputs into the 

inputs. Note that this is done coherently unlike in Fig. |2] 



systems (photons or atoms). Thus the overall scaling for the 
uncertainty would be like Af That is, one does not get 

scaling at the Heisenberg limit if one fixes p and sets N = Mp. 
In practice one cannot even increase M without limit in these 
experiments. For gravitational wave interferometry, there is 
an optimal number of photons for which the noises from 
photon counting and from radiation pressure balance ll30l . 
For atomic clocks, increasing the cloud density similarly leads 
to collisional energy shifts (i.e. phase noise). In summary, in 
both of these cases the limits are set by practical, rather than 
fundamental, considerations. 

It should be noted that the earlier remarks relating to 
measuring modulo 27r/p apply to NOON states with N — p 
in exactly the same way as they do to a single photon with 
p-passes. A NOON state is a state of the form |iV, 0) + |0, N) 
(expressed using the number basis for the two arms of the 
interferometer shown in Fig. |3] ll32l ). For large p, an N — p 
NOON state has an advantage over a single photon with p- 
passes in that the time it takes to pass through the interfer- 
ometer does not scale with p. It has the obvious disadvantage 
of being far more difficult to produce — thus far = 4 
is the largest NOON state demonstrated (and that only in a 
post-selected sense) f33l. It is also far more difficult to detect 
— it requires photon-number-resolving detectors with loss 
smaller than 0{1/N). If it were possible to generate and detect 
"high-NOON" states then the algorithms here could be applied 
directly to that case as well. The equivalence comes from the 
fact that all of the phase information in a NOON state after 
it has passed through the final beam-splitter is contained in 
the parity of the photon number at one detector (which is why 
absurdly high efficiencies are required). Note that although the 
multipass technique can tolerate detector losses of order unity, 
it is sensitive to absorption within the sample in precisely the 
same way as is the NOON state technique. 

From the above comparison between NOON states and 
multiple passes of a single photon, one might argue that there 
are really two resources: N (the number of photon passes in 
both cases); and T, the time taken for the experiment. In the 
limit N oo, the duration T for the NOON-state protocol 
increases only logarithmically with N (as explained in the 
following sections), and for an optimal multiphoton entangled 



state (see Sec. |III-C| l, T need not scale with N at all. In 
the multi-pass case, by contrast, T necessarily scales as N, 
as discussed above. In saying that our technique attains the 
Heisenberg limit, we mean only that it reaches the minimum 
possible phase variance for a given N. That is, to obtain a 
phase estimate with a binary expansion of logj (N) bits, with 
an error in only the least significant bit, we are prepared 
to accept an experiment of exponentially (in log2 N) long 
duration. This can be avoided only by moving the exponential 
cost associated with N from the time to the state itself — 
the NOON state (and its relatives) contain exponentially many 
photons and hence contain an exponentially large amount of 
energy. In practice, photons are fast and small, so N would 
have to become very large for either time or energy to become 
importanj^ On the other hand, the NOON-state protocol also 
requires an exponentially efficient detector. As noted above, 
one requires 1 — 77 to be exponentially (in log2 N) small, 
in contrast to the best current detectors where 1 — 77 barely 
qualifies as "small" at all. Thus it is hard to see why the 
resource T should be regarded as more fundamental than these 
other resources. 

Finally, it might be thought that a multipass interferometer 
is somehow changing the rules of the game, in a way that using 
a NOON state is not. That is, one might argue that the rules 
allow arbitrary preparation as inputs to the interferometer, and 
arbitrary processing of the output, as shown in Fig. |3] but not 
changing of the beam-paths through the sample. However it is 
simple to bypass this objection. If arbitrary preparations and 
measurements are allowed, then one allowed scheme is not to 
measure the output modes, but rather simply to redirect them 
back as the input modes as shown in Fig. |4] The input beam- 
splitter simply undoes the effect of the output beam-splitter, 
and the second passage of the photon through the unknown 
phase shift is exactly as in a two-pass interferometer This 
can be repeated as many times as desired to give a multipass 
interferometer. 

In Ref. [6 1 we implemented a generalized QPEA using 
multi-passed single photons with a common-spatial-mode po- 

^This of course ignores the difficulty of converting Nhuj of energy into a 
NOON state, which is clearly enormous for lai'ge A'^, but hard to quantify. 
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Experimental data, adaptive M = 6 
Numerical calculation, adaptive M = 6 
Experimental data, adaptive M = 1 (QPEA) 
Analytic calculation, adaptive M = 1 (QPEA) 
Experimental data, standard single pass 
Standard Quantum Limit (numerical calc) 
Heisenberg limit (analytic calculation) 
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Fig. 5. Experimental results from Ref. f6j, of standard deviations of 
phase estimates for varying numbers of resources A^. We compare theoretical 
predictions (lines) and measured values (points, each representing 1,000 
estimates) for standard phase estimation, the QPEA {M = 1), and our 
generalized QPEA (A/ = 6) algorithms. Error bars denote 95% confidence 
intervals. Our algorithm clearly gives better phase estimates than both the 
SQL and the QPEA limit. 



larization interferometer, which is more stable than a Mach- 
Zehnder interferometer. The two arms of the interferometer 
were the right-circular and left-circular polarization modes, 
and the unknown phase (p was implemented as the orientation 
angle of a birefringent half-wave plate. The number of passes 
was varied from 32 to 1, in powers of 2, using electro- 
mechanical devices. We implemented standard interferometry 
(verifying the SQL), the QPEA (surprisingly also scaling as 
the SQL, as will be discussed below), and a generalized 
QPEA in which M = 6 photons (rather than M = 1 as 
in the QPEA) were sent down consecutively for each pass- 
configuration. This last technique gave a scaling the same 
as the Heisenberg limit, with a multiplicative overhead of only 
1.56, as predicted by theory. See Fig. |5] 

in. Quantum Limits to Phase Estimation 
A. Rules and Representations 

In this and following sections we will analyse phase es- 
timation algorithms (adaptive and otherwise) from a purely 
quantum information perspective. That is, we consider general 
qubits rather than photons. In this context, the rules of the 
game are as follows: 

1) We have a gate that performs the unitary operation 
exp{i(j)\l) (1|) on a specific sort of qubit. 

2) We have an indefinite supply of these qubits. 

3) The parameter (fi is initially completely unknown. 

4) We are allowed only N applications of the gate. 

5) We aim to minimize the variance in our best estimate 

</>ost of (j). 

Technically, we use the Holevo variance measure, Vh = 
(exp[i((/) — 0est)])~^ — 1 l34l . |[T4l . as this respects the cyclic 
nature of phase. The variance is the most robust figure of merit, 
in that if it scales well then all other measures will also scale 
well, but not vice versa 



In quantum information language, a photon entering one 
port of the interferometer is represented by preparation of 
the qubit in (say) the logical |0) state. The initial beam 
splitter acts as a Hadamard transform H, yielding H \0) = 
(|0) + |1)) /\/2 = |+). The unknown phase shift in the upper 
arm is represented by the unitary operator exp(i0|l) 
while the known phase shift in the lower arm is represented 
by exp(i6' |0) (0|). The final beam-splitter again transforms 
from the logical (Z) basis to the |±) basis (the X basis). 
Thus if the photonic qubit is measured then this amounts to a 
measurement of X prior to the final beam splitter. Estimating 
the phase on the basis of passing a single photon once through 
the interferometer, and measuring it, is therefore described by 
the circuit 



H 




gj(?|0)(0| 







For compact notation in later circuit diagrams, and also to 
connect more closely to the QPEA of Shor's algorithm, we 
change to a representation where exp(i0 |1) (1|) is represented 
by the controlled-unitai-y gate \l) {1\<S)U + |0) (0| (K) /, where 
U acts on a state \(f)) (which could have any Hilbert space 
dimension) with U \(j)) — e*'* and / is the identity operator 
on this space. We also treat the auxiliary phase 6* as a real- 
number-valued classical register, which controls (indicated by 
a ♦ symbol) the gate R{9) = exp{id |0) (0|). Thus we rewrite 
the above circuit as 




Here if we take 9 to be random (but known), this ensures an 
estimate 0cst with an accuracy which is independent of the 
true value (f>. Here (f)cBt is shifted (indicated by the gate D) 
from 9 by an amount 59 depending in someway (indicated by 
0) on the classical result. The advantage of treating 9 in this 
way will become apparent. 

B. The Standard Quantum Limit 

The standard quantum limit pertains when we simply repeat 
the above single-qubit circuit TV times. That is, we have N 
qubits, independently prepared, independently measured, and 
with exp(i(/)|l) (1|) applied once on each (p — 1). To ensure 
uniform sampling, ^init is random, and 9 is incremented by 
Ti/N between one qubit and the next. The case = 4 is 
shown in Fig. |6] This yields the standard quantum limit (SQL) 
for accuracy, given by 



SQL = F[0est] - l/N for iV > L 



(1) 



We can also allow for 9 to be controlled adaptively between 
one qubit and the next, as shown in Fig. |7] The obvious 
procedure, suggested in Ref. p36l, is to choose 9 so as to 
minimize the expected variance after the measurement whose 
result is influenced by this new 9, which entails averaging 
over the two possible results of the measurement. This local 
optimization gives a slightly more accurate measurement for 
small N, but makes no difference asymptotically ll35l . A 
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Fig. 6. Circuit representation of "standard" interferometry with a controllable auxiliary phase. This defines the SQL. Here N = 4. 
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Fig. 7. Circuit representation of "standard" interferometry with an adaptively controlled auxilliary phase. The adaptation does not improve the asymptotic 
accuracy beyond the SQL. 



global optimization does even better for small N, but again 
essentially no difference for large N 



C. The Heisenberg Limit 

With no restrictions on how the qubits are prepared or 
measured, an obvious approach is to use N qubits, prepared 
in a suitable entangled state, and measured by a suitable 
entangling measurement. In this case the initial and final 
beam-splitters (Hadamards) can be absorbed into the state 
preparation and measurement, and so are irrelevant to the 
problem. So too is the auxilliary phase. Each qubit controls 
the phase gate once. For example, with iV = 3 we have: 



Ent. State Prep. 



r 

u \u\\u 



Ent. Meas. 



Although the phase gates are shown as acting sequentially, 
they can be imagined to act simultaneously. This circuit is 
equivalent to Fig. [3] where arbitrary preparation and mea- 
surement allows the interferometer to achieve its ultimate 
performance, the Heisenberg limit, which is given by |[36l 



HL = V^[0est] = tan^ 



N + 2 



(2) 



This is attained by using the canonical phase measurement, 
described by phase states 



N 



|a>)(x^e^*"|n,7V-n)s, 



(3) 



and the optimal input state ll37l . Il36l . 



JV 



1^, 



opt/ oc > sm 

n=0 



(n+ l)7r 
N + 2 



\n,N - 



(4) 



Here |n,7V — n)g is a symmetrized state in which n qubits 
are in state |1) and TV — n in state |0). (For identical and 
indistinguishable bosons such as photons, in two modes, this 
symmetry is enforced by the quantum statistics.) Note that 
this entangled state differs from the NOON state, which has 
the form |iV, 0)g + |0, iV)g in this notation. The NOON state 
gives the maximum Fisher information (equal to N) from a 
single measurement, but has an appalling variance (the Holevo 
variance is in fact infinite) because it only detects changes in (f) 
modulo 2tt/N . The optimal state IV'opt) has only a moderately 
smaller Fisher information — about 0.36iV Ii35i — and the 
minimum variance. 

In the asymptotic limit. 



HL = y[(/.est] ~ (7r/iV)' for TV » 1, 



(5) 



11=0 



which is quadratically better than the SQL. The Heisenberg- 
limit scaling implies that to obtain K + 1 bits of precision 
for 0ost, we require of order N ~ 2^^^^ qubits. That is, it is 
exponentially costly in "spatial" resources^ However this is 
not necessary. The fact that the optimal state and measurement 
can be written using a symmetrized basis with only + 1 
basis states (out of a total Hilbert space dimension of 2^ 
for N qubits) allows an alternate representation. Assuming 
N = 2^~^^ — 1, we can use just K + 1 qubits, and define a 
new \n) as the logical state of a register of qubits with n a 

*In the context of optical interferometry, this corresponds to the exponential 
energy cost of NOON-like states discussed in Sec. |II-C| 
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binary string of length A' + 1. The price to be paid is that the 
evolution generated by the phase gates, 



N 

E 



sm 



N + 2 



N 

E 

n=0 



e""* sin 



(n + l)7r' 



(6) 

now requires an exponential number of phase shifts on the 
"most significant qubits" in the binary representation of n. 
That is, we have swapped an exponential cost in spatial 
resources for an exponential cost in time resources, if each 
phase gate is assumed to take a fixed time. 

The canonical phase measurement, described by the phase 
states ([3]), is a measurement in a basis conjugate to the logical 
basis. The transformation from one basis to the other is exactly 
what is achieved by the quantum Fourier transform ll26l . 
Thus using the binary representation rather than the symmetric 
representation, the Heisenberg limit can be achieved by the 
circuit shown in Fig. [8] In this instance, there are K + 1 = 3 
qubits, but N = 2^+^ — 1=7. The estimate 0ost is read- 
out from the results of Z measurements as shown, and we 
are using the notation r = [r]a.[r]i[r]2 ■ ■ ■■ The kth qubit 
(fc — 0,1,--- ,K) "passes" the phase gate 2*^ times. As 
stressed above, even though we represent (for instance) 4 
applications of the phase gate by a single controlled- [/* gate, 
this must be regarded as using 4 resources. 

It is a remarkable fact, first pointed out by Griffiths and 
Niu [38 1, that the QFT^^ can be achieved by local (i.e. 
single-qubit) measurement and control. This can be seen by 
moving the measurements back through the QFT^^ and the 
controlled-phase gates, using the gate commutation properties. 
The control is often called feed-forward, but since each 
controlled qubit and measured qubit are entangled prior to 
the measurement, the control is arguably feedback based on a 
partial measurement of a multi-qubit system. Indeed, we will 
use this terminology even when the qubits are independently 
prepared, because they are still correlated (from the point of 
view of the experimenter) due to the action of the phase gate 
with unknown phase (j>. In any case, this adaptive scheme 
makes the measurement component of this Heisenberg-limited 
protocol far easier to implement experimentally. See Fig. |9] 

IV. The Quantum Phase Estimation Algorithm 

Although the Griffiths-Niu technique makes the measure- 
ment easy to implement, attaining the exact Heisenberg limit 
still requires creating a multi-qubit entangled state, which is 
hard. This suggests exploring what happens if we replace 
the entangled state by independent qubits as in the standard 
scheme. This yields the quantum phase estimation algorithm 
(QPEA) 1231, 1241. If we mediate the control steps via the 
auxilliary phase 0, and introduce a random 0i„it to ensure 
equal accuracy for all 0, then the QPEA is represented by the 



circuit in Fig. 10 This looks almost identical to the adaptive 



version of the standard protocol, as shown in Fig [7] The 
difference is in the multiple gate applications on a single 
qubit; the QPEA with K + 1 register qubits again uses 

N ^ 2^ + 2^^"i H h 1 = 2^+1 - 1 resources, whereas 

the standard scheme uses K + 1 resources. 



Since the QPEA gives K+ 1 bits of ^cst/Tr, and N - 2^+^ 
would we not expect 

QPEA : V[q),st] oc ^ {n/Nf (the HL) ? (7) 

Contrary to this expectation, an exact calculation ||6l, ||39]| 

gives 



QPEA : V[(t>est] - 2/iV cx 1/iV (the SQL). 



(8) 



So what went wrong? Why does the algorithm not only fail 
to attain the Heisenberg limit, but actually do worse than the 
SQL? The short answer is: Outliers. The distribution P{(f>cst) 
is sharply peaked around 0. The half-width at half-maximum 
height is given by 



QPEA : (HWHM)2 ~ (2.81/7V)2 



(9) 



But the distribution has high wings, giving SQL scaling for 
the variance. Specifically 



QPEA 



sin2[iV((50/2)] 
Nsm'^[6(f>/2] ' 



(10) 



where 6<j) = (pest — In the wings, Pqpea(<J0) has an 
envelope that falls only like (Scj))^'^. This is a consequence 
of the fact that we are using not the optimal state (|4]) with a 
wide but smooth number distribution, but rather a state with 
a "flat" number distribution. 



N 



(11) 



It is the sharp cut-off of the number coefficients that leads to 
the poor localization of phase, the conjugate variable. 

Although outliers in (pest are important for phase estimation 
(where our figure of merit is the variance), they are not 
important for quantum computing applications. There, all one 
cares about is getting the right answer (to the number of bits of 
precision one has) from the algorithm with some reasonably 
high probability, and the QPEA works fine for this purpose 
1231 , l24 j. Indeed, if (p/n had an exact binary expansion in 
K + 1 bits, then we could remove the random 6'init, and the 
QPEA with K + 1 qubits would be guaranteed to find (p 
exactly; the variance would be zero. While this assumption 
may be relevant in some quantum computing applications, it 
is contrary to the rules of the game (Sec. Ill A) for phase 
estimation. 

V. The Generalized QPEA 

One way to understand the high wings of the QPEA 
distribution is that if an error in an insignificant bit occurs, 
it propagates upwards into the more significant bits through 
the feedback protocol. This suggests a way to remove such 
errors: by repeating each measurement some number M times, 
as previously suggested in various settings |40l , BTl . l42l . 
128^. Recall that in the QPEA the A:th qubit (fc = 0, 1, • • • ,K) 
controls the phase gate 2'^ times. We generalize this by having 
M qubits for each 2'^ -fold application, so that the total number 
of passes through of the phase gate is iV = M x (2^+^ — 1). 

With this generalization it is no longer clear how to change 
the auxiliary phase 9 between measurements. Giovannetti, 
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Fig. 8. Circuit representation of Heisenberg-Iimited interferometry using a binary encoding. Here tlie resource count isAf = 4 + 2 + l = 7. 
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Fig. 9. Circuit representation of Heisenberg-Iimited interferometry as in Fig. H] but using the adaptive measurement scheme of Griffiths and Niu. 
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Fig. 10. The Quantum Phase Estimation Algorithm for K = 3, so that N = 7. 



u 



D{5e) 




Fig. 11. Our generalized QPEA for the case M = 2 and K = 1 (so that = 6). 



Lloyd, and Maccone f28l, considering the same problem as 
here, imply (when discussing a NOON-state realization rather 
than the equivalent binary-encoding implementation) that the 
adaptation of 9 is unnecessary; we will return to this point in 
Sec. VI We prefer to keep the adaptation of 9 because it is 
an integral part of the QPEA. We use the adaptive algorithm 
of Ref. f36l to make the locally optimal measurement, as 
explained in Sec. 



scaling at the SQL. (In this it is similar to the heterodyne 



m-B 



For M = 1 this exactly reproduces 
the QPEA, which is why we regard this family of algorithms 
(parametrized by M) as the natural generalization of the 
QPEA. 

For M > 1 our generalized algorithm no longer realizes an 
optimal phase measurement; the Griffiths and Niu trick of real- 
ising an optimal phase measurement by local measurement and 
control only works for a single copy of the quantum register 
with binary-encoded phase. In fact, for M = 2 as illustrated 
in Fig. 11 the measurement is far worse than optimal. As 
we have shown analytically (|6l, ||39l . the measurement in this 
case introduces so much noise that the estimate has a variance 



measurement of Sec. II-B ) This is so despite the fact that 
the relevant state in this case, IV'flat)^^. would give a nearly 
Heisenberg-Iimited phase estimate if one could implement an 
optimal measurement. For Af — 3, numerical simulations 
show that the non-optimal measurement introduces an excess 
noise variance with scaling consistent with N^^^^, interme- 
diate between the SQL and the Heisenberg limit. (In this it is 
similar to the Mark 11 measurement of Sec. |II-B| ) For M > 4, 
numerical simulations show that the measurement allows, as 
for the M = 1 cas^ estimation with accuracy scaling at 
the Heisenberg limit. All of these numerical simulations were 
performed up to iV > 10^, far into the asymptotic regime. 

If one increased M indefinitely, with K fixed, one could not 
hope to achieve Heisenberg-Umited scahng. Intuitively, this is 
because the Heisenberg-Iimited sensitivity comes from having 
a maximum number of passes p — 2^ which scales linearly 

^Heisenberg-Iimited accuracy is not attained in the M = 1 case because 
the prepared state does not have a Heisenberg-Iimited phase variance. 
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Fig. 12. Non-adaptive algorithm with M (K, k) = Mk + fJ.{K - k). Here K = I, Mk = 1, and /.t = 2, so that = 5. 



with N. Alternatively, it can be understood from the fact that 
the State \ipfiat)^^^ ceases to have a broad number distribution 
(and hence a narrow phase distribution) as M increases. In fact 
it can be shown analytically that for large M, the multiplicative 
overhead above the Heisenberg limit increases like M. Thus 
there must be an optimal value of M equal to 4 or higher 
Numerically we found the best results to be for M — 5 |,39J: 



A/ = 5 GQPEA : V[(f>,st] ^ (4.8/7V)^ 



(12) 



compared to {tt/NY for the Heisenberg limit. That is, even 
though the state IV'flat)'^^ is not the optimal state, and the 
locally optimal adaptive measurement is not a canonical phase 
measurement, the multiplicative overhead on is less than 
1.53. Although theoretically M = 5 is the optimal choice for 
minimizing the variance, it still yields more outliers than is the 
case for larger M. Experimentally this means not that it is hard 
to measure with a precision scaling at the Heisenberg limit, 
but rather that it is hard to prove this precision because of 
the difficulty of obtaining reliable statistics. For this reason, 
in Ref. 161 we experimentally implemented the generalized 
QPEA for M — for which the multiplicative overhead on 
A(/) is about 1.56. 

VI. Are Adaptive Algorithms Necessary? 
A. A Less Adaptive "Hybrid" Algorithm 



Our Generalized QPEA (Fig. Ill involves Bayesian feed- 
back, in which every past result contributes to determining 
the auxilliary phase increment 50, according to the Berry- 
Wiseman protocol |36|. This is to be contrasted with the orig- 



inal QPEA (Fig. 10 1 in which only the immediately preceding 
result is required. As noted above, the QPEA would achieve 
the Heisenberg limit if it were not for outliers. This suggests 
that we could get to the HL more simply by augmenting 
the (simple adaptive) QPEA with the (non-adaptive) SQL 
algorithm to remove outliers. The reasoning is that the non- 
adaptive SQL algorithm, consisting basically of a large number 
of indepedent trials, would be expected to have a probability 
distribution for the estimate which is roughly Gaussian, and 
so would have exponentially suppressed wings, and as long 
as a reasonable fraction of the resources is still devoted to the 
QPEA, then its narrow peak would hopefully remain intact. 

Investigating this hybrid scheme anaytically and numeri- 
cally, reveals that the optimal division of resources is {2/'i)N 



for the QPEA and (1/3)A^ for the standard scheme 
However, contrary to expectation, we find that 

Hybrid : y[0est] = w{N)/N'"^, 



(13) 



where w{N) is a function that increases very slowly with N 
(from 4.83 at TV = 5 to 6.17 at TV = 767) (29]. That is, 
the hybrid scheme delivers a scaling intermediate between the 
SQL and the Heisenberg limit. Nevertheless it is interesting 
that by combining two measurement schemes, both of which 
give a variance scaling at the SQL, one obtains a much better 
scaling. 

B. A Non-Adaptive Local Algorithm 

The above hybrid algorithm is mostly non-adaptive — 
it has only K = 0{logN) adaptive measurements out of 
0{N) measurements total. As noted, it surpasses the SQL in 
accuracy. In it, the number of qubits with 2*^ passes through the 
phase gate is M{K, k) = \ + Skfi'i^ ■ This raises the question: 
can one get to the Heisenberg limit with no feedback by choos- 
ing a smoother function M{K, k) that still (like the hybrid 
scheme) assigns more qubits to smaller fc-values? Attacking 
this question analytically suggests considering functions of the 
form |i29J 

M{K,k) = MK+[lJ.{K-k)\, (14) 

where /i is a positive constant. For each value of k, the 
AI{K,k) qubits are measured independently, using the aux- 
illiary phase 9 only to ensure an unbiased measurement. 
For example, it can be incremented by Tr/M{K,k). This is 
illustrated in Fig. 12 with Mk = 1 and fi — 2. 

Numerically we find the best results to be for Mk = 2 and 
^ = 3 



NALA: y[0est] ^ (6.4/iV)2 



(15) 



That is, the overhead on is less than 2.03. This is not 
greatly bigger than the lowest known overhead for an adaptive 
scheme, of 1.53 for the M — 5 GQPEA. It is important to 
note that numerical simulations for a nonadaptive scheme with 
M fixed do not show scaling at the Heisenberg limit. This is 
contrary to the claim (rather casually made) in Ref. [28il that 
the Heisenberg limit for the variance can be attained from 
measuring each bit in the binary expansion of </> by making 
nonadaptive measurements (their ly is our M) for each bit, 
and then taking "the limit of large v". It is not clear what this 
limit is meant to be; certainly cannot be arbitrarily large as 
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this would lead back to the SQL as discussed in Sec. V ^ Our 
numerics show that nonadaptive measurements with a large 
but fixed M seem to give a scaling close to the Heisenberg 
limit up to a point, but as N is increased (by increasing K) 
beyond that point the variance ceases to scale as 7V~^. 
In additional to this numerical evidence, the above algorithm 



using M{K,k) of the form ( 14 1 is the only known nonadaptive 



algorithm on single qubits which has been rigorously proven 
to attain the Heisenberg limit. The proof ll29ll . f39J involves 
values of Mk and fi that are known (from numerics) not to 
be optimal, but which allow one to rigorously bound all the 
contributions to the variance, using Chernoff's theorem. To 
apply this theorem one needs repeated identical measurements, 
so the proof assumes using just two values of 9: and 7r/2. 
Specifically, for M{K,k) = 18 + [161n(2) x (X - k)\ we 
prove that 



NALA: l/[0est] < (150/iV)^ 



(16) 



The large overhead in this case shows that one would not 
want to use these parameters in practice, but it does prove 
rigorously that adaptive measurements are not necessary to 
attain the Heisenberg limit in interferometry using only single- 
qubit preparation and measurement. 

VII. Conclusion 

Recent years have seen the addition of adaptive measure- 
ments to the arsenal of techniques used in quantum optics 
information laboratories to probe the quantum world. The 
notable protocols which we have discussed here are: the 
Dolinar receiver |7J, realised in Ref. Q; the Mark II phase 
measurement of Refs. lfT4l . ifTSll . realised in Ref. ||5l; and 
the generalized quantum phase estimation algorithm proposed 
and realized in Ref. |6|. These protocols are all intimately 
related to quantum information, through quantum communi- 
cation, quantum computation, and quantum algorithm theory 
respectively. In this context, the recent quantum computing 
cluster-state experiment of Ref. Il22l should also be mentioned, 
as perhaps the first adaptive measurement in which the adapted 
measurement was performed on a subsystem that was initially 
entangled with the first-measured subsytem. 

The bulk of this paper has concentrated on the work of 
Ref. |2J, which is unique in that the adaptive measurement 
protocol was inspired by quantum algorithm theory, but serves 
a purpose quite different from quantum computing, namely 
estimating a completely unknown phase shift </> in one arm 
of an interferometer with a fixed number of photon-passes 
through the interferometer. We analysed this from a quantum 
information perspective (e.g. replacing photons by qubits and 
phase shifts by controUed-unitaries) to make the connection to 
quantum algorithm theory as explicit as possible. 

^Giovannetti et al. |28| perhaps miss this point because they appear 
to ignore the u repetitions in their count of the resources, in deriving 
"the Heisenberg limit [as] the ultimate bound to precision in phase 

measurements." Indeed they call N the "number of probes" (i.e. register 
qubits), rather than the number of applications of the phase gate. This is 
despite the fact that they also say that "Instead of a parallel strategy on N 
probes, one can employ a sequential strategy on a single probe," while at 
the same time saying that "one finds the same 1/N precision scaling ...for 
sequential strategies." 



To attain exactly the Heisenberg limit for the variance of 
the estimate, y[(/)ost] ~ (i"/^)^^ the most efficient (in terms 
of minimizing the number of qubits in the register, and the 
number of entangling gates performed on the register) protocol 
requires all of the following: 

1) preparing an entangled state of O(logA^) qubits. 

2) multiple applications of the controlled-unitary gate by 
any given qubit. 

3) adaptive measurements (control of individual qubits 
based on prior results). 

We have shown numerically, and experimentally in Ref. f6l, 
that using a generalized quantum phase estimation algorithm 
one can dispense with the entangled state preparation, and 
still achieve Heisenberg-limited scaling, and indeed come very 
close to the Heisenberg limit: 



GQPEA : l/[0est] - (1.537r/Ar)2 



(17) 



Given that it is impossible, using only single-qubit measure- 
ment and controls, to produce simultaneously the optimal state 
and the optimal measurement, our phase estimation algorithm 
must be close to the best achievable with this restriction. 

Finally, we briefly discussed some unpublished results ||29l , 
||39l showing analytically and numerically that one can achieve 
Heisenberg-limited scaling, although with an increased over- 
head, even without the adaptive measurements. However this 
requires a sophisticated partitioning of resources, contrary to 
some claims in the literature (e.g. Ref. I.28J ). 
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